Efficient classification of noisy speech using neural networks

نویسندگان

Cathy Shao

Martin Bouchard

چکیده

The classification of active speech vs. inactive speech in noisy speech is an important part of speech applications, typically in order to achieve a lower bit-rate. In this work, the error rates for raw classification (i.e. with no hangover mechanism) of noisy speech obtained with traditional classification algorithms are compared to the rates obtained with Neural Network classifiers, trained with different learning algorithms. The traditional classification algorithms used are the linear classifier, some Nearest Neighbor classifiers and the Quadratic Gaussian classifier. The training algorithms used for the Neural Networks classifiers are the Extended Kalman Filter and the Levenberg-Marquadt algorithm. An evaluation of the computational complexity for the different classification algorithms is presented. Our noisy speech classification experiments show that using Neural Network classifiers typically produces a more accurate and more robust classification than other traditional algorithms, while having a significantly lower computational complexity. Neural Network classifiers may therefore be a good choice for the core component of a noisy speech classifier, which would typically also include a hangover mechanism and possibly a speech enhancement algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of Gender and Age Classification in Speech Signals

Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...

متن کامل

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...

متن کامل

طبقه بندی و شناسایی رخساره‌های زمین‌شناسی با استفاده از داده‌های لرزه نگاری و شبکه‌های عصبی رقابتی

Geological facies interpretation is essential for reservoir studying. The method of classification and identification seismic traces is a powerful approach for geological facies classification and distinction. Use of neural networks as classifiers is increasing in different sciences like seismic. They are computer efficient and ideal for patterns identification. They can simply learn new algori...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

A Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images

Convolutional neural network is one of the effective methods for classifying images that performs learning using convolutional, pooling and fully-connected layers. All kinds of noise disrupt the operation of this network. Noise images reduce classification accuracy and increase convolutional neural network training time. Noise is an unwanted signal that destroys the original signal. Noise chang...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2003

Efficient classification of noisy speech using neural networks

نویسندگان

چکیده

منابع مشابه

A Comparative Study of Gender and Age Classification in Speech Signals

An Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition

طبقه بندی و شناسایی رخساره‌های زمین‌شناسی با استفاده از داده‌های لرزه نگاری و شبکه‌های عصبی رقابتی

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

A Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images

عنوان ژورنال:

اشتراک گذاری